AITopics | second-order update

Efficient Second-Order Online Kernel Learning with Adaptive Embedding

Neural Information Processing SystemsMar-17-2026, 13:31:43 GMT

Online kernel learning (OKL) is a flexible framework to approach prediction problems, since the large approximation space provided by reproducing kernel Hilbert spaces can contain an accurate function for the problem. Nonetheless, optimizing over this space is computationally expensive. Not only first order methods accumulate $\O(\sqrt{T})$ more loss than the optimal function, but the curse of kernelization results in a $\O(t)$ per step complexity. Second-order methods get closer to the optimum much faster, suffering only $\O(\log(T))$ regret, but second-order updates are even more expensive, with a $\O(t^2)$ per-step cost. Existing approximate OKL methods try to reduce this complexity either by limiting the Support Vectors (SV) introduced in the predictor, or by avoiding the kernelization process altogether using embedding.

artificial intelligence, machine learning, proceedings, (4 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.39)

Add feedback

Efficient Second-Order Online Kernel Learning with Adaptive Embedding

Neural Information Processing SystemsNov-21-2025, 14:52:33 GMT

Online kernel learning (OKL) is a flexible framework to approach prediction problems, since the large approximation space provided by reproducing kernel Hilbert spaces can contain an accurate function for the problem. Nonetheless, optimizing over this space is computationally expensive. Not only first order methods accumulate $\O(\sqrt{T})$ more loss than the optimal function, but the curse of kernelization results in a $\O(t)$ per step complexity. Second-order methods get closer to the optimum much faster, suffering only $\O(\log(T))$ regret, but second-order updates are even more expensive, with a $\O(t^2)$ per-step cost. Existing approximate OKL methods try to reduce this complexity either by limiting the Support Vectors (SV) introduced in the predictor, or by avoiding the kernelization process altogether using embedding.

adaptive embedding, efficient second-order online kernel learning, name change, (3 more...)

Neural Information Processing Systems

Industry: Retail > Online (0.43)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.39)

Add feedback

Efficient Second-Order Online Kernel Learning with Adaptive Embedding

Daniele Calandriello, Alessandro Lazaric, Michal Valko

Neural Information Processing SystemsNov-21-2025, 07:12:38 GMT

Related work Although first-order OKL methods cannot achieve logarithmic regret, many approximation methods have been proposed to make them scale to large datasets.

artificial intelligence, machine learning, pro-n-kon, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Los Angeles County > Long Beach (0.04)
Europe > Germany > Brandenburg > Potsdam (0.04)
Europe > France > Hauts-de-France > Pas-de-Calais (0.04)

Genre: Instructional Material > Online (0.41)

Industry:

Education > Educational Setting (0.46)
Retail > Online (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)

Add feedback

Efficient Second-Order Online Kernel Learning with Adaptive Embedding

Daniele Calandriello, Alessandro Lazaric, Michal Valko

Neural Information Processing SystemsOct-3-2024, 04:20:45 GMT

Online kernel learning (OKL) is a flexible framework for prediction problems, since the large approximation space provided by reproducing kernel Hilbert spaces often contains an accurate function for the problem. Nonetheless, optimizing over this space is computationally expensive. Not only first order methods accumulate O( T) more loss than the optimal function, but the curse of kernelization results in a O(t) per-step complexity.

algorithm, effective dimension, pro-n-kon, (13 more...)

Neural Information Processing Systems

Country:

North America > United States > California > Los Angeles County > Long Beach (0.04)
Europe > Germany > Brandenburg > Potsdam (0.04)
Europe > France > Hauts-de-France > Pas-de-Calais (0.04)

Genre: Instructional Material > Online (0.61)

Industry: Retail > Online (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Machine Unlearning of Features and Labels

Warnecke, Alexander, Pirch, Lukas, Wressnegger, Christian, Rieck, Konrad

arXiv.org Artificial IntelligenceAug-7-2023

Removing information from a machine learning model is a non-trivial task that requires to partially revert the training process. This task is unavoidable when sensitive data, such as credit card numbers or passwords, accidentally enter the model and need to be removed afterwards. Recently, different concepts for machine unlearning have been proposed to address this problem. While these approaches are effective in removing individual data points, they do not scale to scenarios where larger groups of features and labels need to be reverted. In this paper, we propose the first method for unlearning features and labels. Our approach builds on the concept of influence functions and realizes unlearning through closed-form updates of model parameters. It enables to adapt the influence of training data on a learning model retrospectively, thereby correcting data leaks and privacy issues. For learning models with strongly convex loss functions, our method provides certified unlearning with theoretical guarantees. For models with non-convex losses, we empirically show that unlearning features and labels is effective and significantly faster than other strategies.

artificial intelligence, deep learning, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2108.11577

Country:

North America > Canada > Ontario > Toronto (0.14)
North America > United States > New York (0.04)
North America > United States > California > San Diego County > San Diego (0.04)
(2 more...)

Genre: Research Report > New Finding (0.46)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback

Structured second-order methods via natural gradient descent

Lin, Wu, Nielsen, Frank, Khan, Mohammad Emtiyaz, Schmidt, Mark

arXiv.org Machine LearningJul-22-2021

In this paper, we propose new structured second-order methods and structured adaptive-gradient methods obtained by performing natural-gradient descent on structured parameter spaces. Natural-gradient descent is an attractive approach to design new algorithms in many settings such as gradient-free, adaptive-gradient, and second-order methods. Our structured methods not only enjoy a structural invariance but also admit a simple expression. Finally, we test the efficiency of our proposed methods on both deterministic non-convex problems and deep learning problems.

second-order method, structured second-order method, tri-low, (12 more...)

arXiv.org Machine Learning

2107.10884

Country:

North America > Canada > British Columbia (0.04)
North America > Canada > Alberta (0.04)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.82)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)

Add feedback

Efficient Second-Order Online Kernel Learning with Adaptive Embedding

Calandriello, Daniele, Lazaric, Alessandro, Valko, Michal

Neural Information Processing SystemsFeb-15-2020, 19:42:24 GMT

Online kernel learning (OKL) is a flexible framework to approach prediction problems, since the large approximation space provided by reproducing kernel Hilbert spaces can contain an accurate function for the problem. Nonetheless, optimizing over this space is computationally expensive. Not only first order methods accumulate $\O(\sqrt{T})$ more loss than the optimal function, but the curse of kernelization results in a $\O(t)$ per step complexity. Second-order methods get closer to the optimum much faster, suffering only $\O(\log(T))$ regret, but second-order updates are even more expensive, with a $\O(t 2)$ per-step cost. Existing approximate OKL methods try to reduce this complexity either by limiting the Support Vectors (SV) introduced in the predictor, or by avoiding the kernelization process altogether using embedding.

adaptive embedding, efficient second-order online kernel learning, second-order update, (1 more...)

Neural Information Processing Systems

Genre: Instructional Material > Online (0.64)

Industry: Retail > Online (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.44)

Add feedback

Efficient Second-Order Online Kernel Learning with Adaptive Embedding

Calandriello, Daniele, Lazaric, Alessandro, Valko, Michal

Neural Information Processing SystemsDec-31-2017

Online kernel learning (OKL) is a flexible framework to approach prediction problems, since the large approximation space provided by reproducing kernel Hilbert spaces can contain an accurate function for the problem. Nonetheless, optimizing over this space is computationally expensive. Not only first order methods accumulate $\O(\sqrt{T})$ more loss than the optimal function, but the curse of kernelization results in a $\O(t)$ per step complexity. Second-order methods get closer to the optimum much faster, suffering only $\O(\log(T))$ regret, but second-order updates are even more expensive, with a $\O(t^2)$ per-step cost. Existing approximate OKL methods try to reduce this complexity either by limiting the Support Vectors (SV) introduced in the predictor, or by avoiding the kernelization process altogether using embedding. Nonetheless, as long as the size of the approximation space or the number of SV does not grow over time, an adversary can always exploit the approximation process. In this paper, we propose PROS-N-KONS, a method that combines Nystrom sketching to project the input point in a small, accurate embedded space, and performs efficient second-order updates in this space. The embedded space is continuously updated to guarantee that the embedding remains accurate, and we show that the per-step cost only grows with the effective dimension of the problem and not with $T$. Moreover, the second-order updated allows us to achieve the logarithmic regret. We empirically compare our algorithm on recent large-scales benchmarks and show it performs favorably.

artificial intelligence, machine learning, pro-n-kon, (13 more...)

Neural Information Processing Systems

Country: Europe (0.46)

Genre: Instructional Material > Online (0.61)

Industry: